A Microtext Corpus for Persuasion Detection in Dialog
نویسندگان
چکیده
Automatic detection of persuasion is essential for machine interaction on the social web. To facilitate automated persuasion detection, we present a novel microtext corpus derived from hostage negotiation transcripts as well as a detailed manual (codebook) for persuasion annotation. Our corpus, called the NPS Persuasion Corpus, consists of 37 transcripts from four sets of hostage negotiation transcriptions. Each utterance in the corpus is hand annotated for one of nine categories of persuasion based on Cialdini’s model: reciprocity, commitment, consistency, liking, authority, social proof, scarcity, other, and not persuasive. Initial results using three supervised learning algorithms (Naı̈ve Bayes, Maximum Entropy, and Support Vector Machines) combined with gappy and orthogonal sparse bigram feature expansion techniques show that the annotation process did capture machine learnable features of persuasion with F-scores better than baseline.
منابع مشابه
Segmenting Chinese Microtext: Joint Informal-Word Detection and Segmentation with Neural Networks
State-of-the-art Chinese word segmentation systems typically exploit supervised models trained on a standard manually-annotated corpus, achieving performances over 95% on a similar standard testing corpus. However, the performances may drop significantly when the same models are applied onto Chinese microtext. One major challenge is the issue of informal words in the microtext. Previous studies...
متن کاملConstruction and Analysis of a Persuasive Dialogue Corpus
Persuasive dialogue systems, systems which are not passive actors, but actually try to change the thoughts or actions of dialogue participants, have gained some interest in recent dialogue literature. In order to construct more effective persuasive dialogue systems, it is important to understand how the system’s human counterparts perform persuasion. In this paper, we describe the construction ...
متن کاملA CCG-Based Approach to Fine-Grained Sentiment Analysis in Microtext
In this paper, we present a Combinatory Categorial Grammar (CCG) based approach to the classification of emotion in microtext. We develop a method that makes use of the notion put forward by Ortony, Clore, and Collins (1988), that emotions are valenced reactions. This hypothesis sits central to our system, in which we adapt contextual valence shifters to infer the emotional content of a text. W...
متن کاملReports of the 2013 AAAI Spring Symposium Series
Much progress has been made in recent years in several areas within natural language processing. However, so far there has been less work related to microtext (for example, instant messaging, transcribed speech, and microblogs such as Twitter and Facebook). Microtext is made up of semistructured pieces of text that are distinguished by their brevity, informality, idiosyncratic lexicon, nonstand...
متن کاملA Comparison between Microblog Corpus and Balanced Corpus from Linguistic and Sentimental Perspectives
While microblogging has gained popularity on the Internet, analyzing and processing short messages has become a challenging task in natural language processing. This paper analyzes the differences between Internet short messages (or “microtext”) and general articles by comparing the Plurk Corpus and the Sinica Balanced Corpus. Likelihood ratio and the tóngyìcícílín (“ ”) thesaurus are adopted t...
متن کامل